AITopics | function approximator

Collaborating Authors

function approximator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Transfer in reinforcement learning aims at solving a new target task with no additional learning or sample-efficiently by exploiting agents and information obtained from source tasks. We review a line of research with relevant approaches. This group of approaches reuses policies learned on source tasks for target tasks. Fernández and Veloso [17] suggest an exploration strategy for the learning of a new policy given a new task and learned source policies, where the gain of using each policy is estimated together on-line and one of the policies in the set is selected probabilistically at each step, based on the gain, but they focus on aiding the training of the target policy with samples from the target task rather than improving the zero-shot transfer performance. On the other hand, Dayan [14] introduce successor representations (SRs), state space occupancy representations disentangled from rewards, which allow linear decomposition of value functions.

large language model, machine learning, target task, (21 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.36)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.30)

Add feedback

Learning Pipelines with Limited Data and Domain Knowledge: A Study in Parsing Physics Problems

Mrinmaya Sachan, Kumar Avinava Dubey, Tom M. Mitchell, Dan Roth, Eric P. Xing

Neural Information Processing SystemsFeb-14-2026, 03:06:05 GMT

Neural Information Processing Systems http://nips.cc/

data mining, logic & formal reasoning, machine learning, (23 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Quebec > Montreal (0.04)
North America > Canada > Quebec > Capitale-Nationale Region > Québec (0.04)
(4 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (0.69)
(4 more...)

Add feedback

227e072d131ba77451d8f27ab9afdfb7-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-11-2026, 16:52:51 GMT

action-value function, convergence, neural network, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

7eb7eabbe9bd03c2fc99881d04da9cbd-Paper.pdf

Neural Information Processing SystemsFeb-9-2026, 13:06:03 GMT

intervention, interventional distribution, ispn, (14 more...)

Neural Information Processing Systems

Country:

Europe > Germany > Hesse > Darmstadt Region > Darmstadt (0.05)
Asia (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Texas (0.04)

Genre: Research Report (0.46)

Industry:

Education (0.68)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)

Add feedback

6ba3af5d7b2790e73f0de32e5c8c1798-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 19:02:01 GMT

artificial intelligence, machine learning, value function, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.51)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.32)

Add feedback

BooVI: ProvablyEfficientBootstrappedValue Iteration

Neural Information Processing SystemsFeb-8-2026, 06:16:49 GMT

In this paper, we develop a variant of bootstrapped LSVI,namely BooVI, which bridges such agapbetween practice andtheory.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.94)

Add feedback

Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents

Neural Information Processing SystemsDec-26-2025, 12:21:23 GMT

Deep reinforcement learning (RL) has achieved remarkable success in solving complex tasks through its integration with deep neural networks (DNNs) as function approximators. However, the reliance on DNNs has introduced a new challenge called primacy bias, whereby these function approximators tend to prioritize early experiences, leading to overfitting. To alleviate this bias, a reset method has been proposed, which involves periodic resets of a portion or the entirety of a deep RL agent while preserving the replay buffer. However, the use of this method can result in performance collapses after executing the reset, raising concerns from the perspective of safe RL and regret minimization. In this paper, we propose a novel reset-based method that leverages deep ensemble learning to address the limitations of the vanilla reset method and enhance sample efficiency. The effectiveness of the proposed method is validated through various experiments including those in the domain of safe RL. Numerical results demonstrate its potential for real-world applications requiring high sample efficiency and safety considerations.

name change, reset deep ensemble agent, safe deep reinforcement learning, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

Neural Information Processing SystemsDec-24-2025, 19:18:21 GMT

Temporal-difference and Q-learning play a key role in deep reinforcement learning, where they are empowered by expressive nonlinear function approximators such as neural networks. At the core of their empirical successes is the learned feature representation, which embeds rich observations, e.g., images and texts, into the latent space that encodes semantic structures. Meanwhile, the evolution of such a feature representation is crucial to the convergence of temporal-difference and Q-learning. In particular, temporal-difference learning converges when the function approximator is linear in a feature representation, which is fixed throughout learning, and possibly diverges otherwise. We aim to answer the following questions: When the function approximator is a neural network, how does the associated feature representation evolve?

feature representation, name change, temporal-difference and q-learning learn representation, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

BooVI: Provably Efficient Bootstrapped Value Iteration

Neural Information Processing SystemsDec-24-2025, 00:09:05 GMT

Despite the tremendous success of reinforcement learning (RL) with function approximation, efficient exploration remains a significant challenge, both practically and theoretically. In particular, existing theoretically grounded RL algorithms based on upper confidence bounds (UCBs), such as optimistic least-squares value iteration (LSVI), are often incompatible with practically powerful function approximators, such as neural networks. In this paper, we develop a variant of \underline{boo}tstrapped LS\underline{VI}, namely BooVI, which bridges such a gap between practice and theory.

boovi, name change, provably efficient bootstrapped value iteration, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.61)

Add feedback

Filters

Collaborating Authors

function approximator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ARelated Work

Learning Pipelines with Limited Data and Domain Knowledge: A Study in Parsing Physics Problems

227e072d131ba77451d8f27ab9afdfb7-AuthorFeedback.pdf

e3bc4e7f243ebc05d66a0568a3331966-Paper.pdf

7eb7eabbe9bd03c2fc99881d04da9cbd-Paper.pdf

6ba3af5d7b2790e73f0de32e5c8c1798-AuthorFeedback.pdf

BooVI: ProvablyEfficientBootstrappedValue Iteration

Sample-Efficient and Safe Deep Reinforcement Learning via Reset Deep Ensemble Agents

Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

BooVI: Provably Efficient Bootstrapped Value Iteration